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(57) Abstract 



A video encoding method and apparatus for adapting a video input to a bandwidth of a transmission channel of a network that 
includes determining the number N enhancement layer bitstreams capable of being adapted to the bandwidth of the transmission channel 
of a network. A base layer bitstream is encoded from the video input wherein a plurality of enhancement layer bitstreams are encoded 
from the video input. The enhancement layer bitstreams are based on the base layer bitstream, wherein the plurality of enhancement layer 
bitstreams complements the base layer bitstream and the base layer bitstream and N enhancement layer bitstreams are transmitted to the 
network. 
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SCALABLE VIDEO CODING AND DECODING 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a method and apparatus for the scaling of data 
signals the bandwidth of the transmission channel; and more particularly to a scalable 
video method and apparatus for coding video such that the received video is adapted 
to the bandwidth of the transmission channel. 

Description of Related Art 

Signal compression in the video arena has long been employed to increase the 
bandwidth of either the generating, transmitting, or receiving device. MPEG - an 
acronym for Moving Picture Experts Group - refers to the family of digital video 
compression standards and file formats developed by the group. For instance, the 
MPEG-1 video sequence is an ordered stream of bits, with special bit patterns marking 
the beginning and ending of a logical section. 

MPEG achieves high compression rate by storing only the changes from one 
frame to another, instead of each entire frame. The video information is then encoded 
using a technique called DCT (Discrete Cosine Transform) which is a technique for 
representing a waveform data as a weighted sum of cosines. MPEG use a type of 
lossy compression wherein some data is removed. But the diminishment of data is 
generally imperceptible to the human eye. It should be noted that the DCT itself does 
not lose data; rather, data compression technologies that rely on DCT approximate 
some of the coefficients to reduce the amount of data. 

The basic idea behind MPEG video compression is to remove spatial 
redundancy within a video frame and temporal redundancy between video frames. The 
DCT-based (Discrete Cosine Transform) compression is used to reduce spatial 
redundancy and motion compensation is used to exploit temporal redundancy. The 
images in a video stream usually do not change much within small time intervals. 
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Thus, the idea of motion-compensation is to encode a video frame based on other 
video frames temporally close to it. 

A video stream is a sequence of video frames, each frame being a still image. 
A video player displays one frame after another, usually at a rate close to 30 frames per 
second. Macroblocks are formed, each macroblock consists of four 8x8 luminance 
blocks and two 8x8 chrominance blocks. Macroblocks are the units for 
motion-compensated compression, wherein blocks are basic unit used for DCT 
compression. Frames can be encoded in three types: intra-frames (I-frames), forward 
predicted frames (P-frames), and bi-directional predicted frames (B-frames). 

An I-frame is encoded as a single image, with no reference to any past or future 
frames. Each 8x8 block is encoded independently, except that the coefficient in the 
upper left corner of the block, called the DC coefficient, is encoded relative to the DC 
coefficient of the previous block. The block is first transformed from the spatial 
domain into a frequency domain using the DCT (Discrete Cosine Transform), which 
separates the signal into independent frequency bands. Most frequency information is 
in the upper left corner of the resulting 8x8 block. After the DCT coefficients are 
produced the data is quantized, i.e. divided or separated. Quantization can be thought 
of as ignoring lower-order bits and is the only lossy part of the whole compression 
process other than sub-sampling. 

The resulting data is then run-length encoded in a zig-zag ordering to optimize 
compression. The zig-zag ordering produces longer runs of O's by taking advantage of 
the fact that there should be little high-frequency information (more O's as one zig-zags 
from the upper left corner towards the lower right comer of the 8x8 block). 

A P-frame is encoded relative to the past reference frame. A reference frame is 
a P- or I-frame. The past reference frame is the closest preceding reference frame. A 
P-macroblock is encoded as a 1 6 x 16 area of the past reference frame, plus an error 
term. 

To specify the 16 x 16 area of the reference frame, a motion vector is included. 
A motion vector (0, 0) means that the 1 6 x 1 6 area is in the same position as the 
macroblock we are encoding. Other motion vectors are generated are relative to that 
position. Motion vectors may include half-pixel values, in which case pixels are 
averaged. The error term is encoded using the DCT, quantization, and run-length 
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encoding. A macroblock may also be skipped which is equivalent to a (0, 0) vector 
and an all-zero error term. 

A B-frame is encoded relative to the past reference frame, the future reference 
frame, or both frames. 

A pictorial view of the above processes and techniques in application sire 
depicted in prior art Fig. 15, which illustrates the decoding process for a SNR 
scalability. Scalable video coding means coding video in such a way that the quality of 
a received video is adapted to the bandwidth of the transmission channel. Such a 
coding technique is very desirable for transmitting video over a network with a time- 
varying bandwidth. 

SNR scalability defines a mechanism to refine the DCT coefficients encoded in 
another (lower) layer of a scalable hierarchy. As illustrated in prior art Fig. 15, data 
from two bitstreams is combined after the inverse quantization processes by adding 
the DCT coefficients, Until the dat is combined, the decoding processes of the two 
layers are independent of each other. 

The lower layer (base layer) is derived from the first bitstream and can itself be 
either non-scalable, or require the spatial or temporal scalability decoding process, and 
hence the decoding of additional bitstream, to be applied. The enhancement layer, 
derived from the second bitstream, contains mainly coded DCT coefficients and a small 
overhead. 

In the current MPEG-2 video coding standard, there is an SNR scalability 
extension that allows two levels of scalability. MPEG achieves high compression rate 
by storing only the changes from one frame to another, instead of each entire frame. 
There are at least two disadvantages of employing the MPEG-2 standard for encoding 
video data. One disadvantage is that the scalability granularity is not fine enough, 
because the MPEG-2 process is an all or none method. Either the receiving device can 
receive all of the data from the base layer and the enhancement layer or only the data 
from the base layer bitstream. Therefore, the granularity is not scalable. In a network 
environment, more than two levels of scalability are usually needed. 

Another disadvantage is that the enhancement layer coding in MPEG-2 is not 
efficient. Too many bits are needed in the enhancement layer in order to have a 
noticeable increase in video quality. 
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The present invention overcomes these disadvantages and others by providing, 
among other advantages, an efficient scalable video coding method with increased 
granularity. 

SUMMARY OF THE INVENTION 

The present invention can be characterized as a scalable video coding means 
and a system for encoding video data, such that quality of the final image is gradually 
improved as more bits are received. The improved quality and scalability are achieved 
by a method wherein an enhancement layer is subdivided into layers or levels of 
bitstream layers. Each bitstream layer is capable of carrying information 
complementary to the base layer information, in that as each of the enhancement layer 
bitstreams are added to the corresponding base layer bitstreams the quality of the 
resulting images are improved. 

The number N of enhancement layers is determined or limited by the network 
that provides the transmission channel to the destination point. While the base layer 
bitstream is always transmitted to the destination point, the same is not necessarily true 
for the enhancement layers. Each layer is given a priority coding and transmission is 
effectuated according to the priority coding. In the event that all of the enhancement 
layers cannot be transmitted the lower priority coded layers will be omitted. The 
omission of one or more enhancement layers may be due to a multitude of reasons. 

For instance, the server which provides the transmission channel to the 
destination point may be experiencing large demand on its resources from other users, 
in order to try and accommodate all of its users the server will prioritize the data and 
only transmit the higher priority coded packets of information. The transmission 
channel may be the limiting factor because of the bandwidth of the channel, i.e. 
Internet access port, Ethernet protocol, LAN, WAN, twisted pair cable, co-axial cable, 
etc. or the destination device itself, i.e. modem, absence of an enhanced video card, 
etc. may not be able to receive the additional bandwidth made available to it. In these 
instances only M number (M is an integer number = 0, 1, 2, . . .) of enhancement layers 
may be received, wherein N number (N is an integer number = 0, 1, 2, . . .) of 
enhancement layers were generated at the encoding stage, M < N. 
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To achieve these and other advantages and in accordance with the purpose of 
the present invention, as embodied and broadly described, the scalable video method 
and apparatus according to one aspect of the invention includes a video encoding 
method for adapting a video input to a bandwidth of a transmission channel of a 
5 network, the method includes determining the number N of enhancement layer 

bitstreams capable of being adapted to the bandwidth of the transmission channel of 
the network. Encoding a base layer bitstream from the video input is then performed 
and encoding N number of enhancement layer bitstreams from the video input based on 
the base layer bitstream, wherein the plurality of enhancement layer bitstreams 

10 complements the base layer bitstream. The base layer bitstream and the N 

enhancement layer bitstreams are then provided to the network. 

According to another aspect of the present invention, a video decoding method 
for adapting a video input to a bandwidth of a transmission channel of a network 
includes, determining number M of enhancement layer bitstreams of said video input 

15 capable of being received from said transmission channel of said network. Decoding a 

base layer bitstream from received video input and decoding M number of 
enhancement layer bitstreams from the received video input based on the base layer 
bitstream, wherein the M received enhancement layer bitstreams complements the base 
layer bitstream. Then reconstructing the base layer bitstream and N enhancement layer 

20 bitstreams. 

According to still another aspect of the present invention, a video decoding 
method for adapting a video input to a bandwidth of a receiving apparatus, the method 
includes demultiplexing a base layer bitstream and at least one of a plurality of 
enhancement layer bitstreams received from a network, decoding the base layer 

25 bitstream, decoding at least one of the plurality of enhancement layer bitstreams based 

on generated base layer bitstream, wherein the at least one of the plurality of 
enhancement layer bitstreams enhances the base layer bitstream. Then reconstructing a 
video output. 

According to a further aspect of the present invention, a video encoding 
30 method for encoding enhancement layers based on a base layer bitstream encoded from 

a video input, the video encoding method includes, taking a difference between an 
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original DCT coefficient and a reference point and dividing the difference between the 
original DCT coefficient and the reference point into N bit-planes. 

According to a still further aspect of the present invention, a method of coding 
motion vectors of a plurality of macroblocks, includes determining an average motion 
vector from N motion vectors for N macroblocks, utilizing the determined average 
motion vector as the motion vector for the N macroblocks, and encoding 1/N motion 
vectors in a base layer bitstream. 

Additional features and advantages of the invention will be set forth in the 
description which follows, and in part will be apparent from the description, or may be 
learned by practice of the invention. The aspects and other advantages of the invention 
will be realized and attained by the structure particularly pointed out in the written 
description and claims hereof as well as the appended drawings. 

It is to be understood that both the foregoing general description and the 
following detailed description are exemplary and explanatory and are intended to 
provide further explanation of the invention as claimed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are included to provide a further 
understanding of the invention and are incorporated in and constitute a part of this 
specification, illustrate embodiments of the invention and together with the description 
serve to explain the principles of the invention. In the drawings: 

Fig. 1 illustrates a flow diagram of the scalable video encoding method of the 
present invention; 

Fig. 2 A illustrates conventional probability distribution of DCT coefficient 

values; 

Fig. 2B illustrates conventional probability distribution of DCT coefficient 
residues; 

Fig. 3 A illustrates the probability distribution of DCT coefficient values of the 
present invention; 

Fig. 3B illustrates the probability distribution of DCT coefficient residues of the 
present invention; 
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20 



25 



Figs. 3C and 3D illustrates a method for taking a difference of a DCT 
coefficient of the present invention; 

Fig. 5 illustrates a flow diagram for finding the maximum number of bit-planes 



Fig. 6 illustrates a flow diagram for generating (RUN, EOP) Symbols of the 
present invention; 

Fig. 7 Illustrates a flow diagram for encoding enhancement layers of the 
present invention; 

Fig. 8 illustrates a flow diagram for encoding (RUN, EOP) symbols and 
sign_enh values of one DCT block of one bit-plane; 

Fig. 9 illustrates a flow diagram for encoding a sign_enh value of the present 
invention; 

Fig. 10 illustrates a flow diagram for adding enhancement difference to a DCT 
coefficient of the present invention; 

Fig. 1 1 illustrates a flow diagram for converting enhancement difference to a 
DCT coefficient of the present invention; 

Fig. 12 illustrates a flow diagram for decoding enhancement layers of the 
present invention; 

Fig. 13 illustrates a flow diagram for decoding (RUN, EOP) symbols and 
sign_enh values of one DCT block of one bit-plane; 

Fig. 14 illustrates a flow diagram for decoding a sign_enh value; and 
Fig. 15 illustrates a prior a conventional SNR scalability flow diagram. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Reference will now be made in detail to the preferred embodiments of the 
present invention, examples of which are illustrated in the accompanying drawings. 

Fig. 1 illustrates the scalable video diagram 10 of an embodiment of the present 
invention. The original video input 20 is encoded by the base layer encoder 30 in 
accordance with the method of represent by flow diagram 400 of Fig. 4. A DCT 
coefficient OC and its corresponding base layer quantized DCT coefficient QC are 
generated and a difference determined pursuant to steps 420 and 430 of Fig. 4. The 



in the DCT differences of a frame of the present invention; 
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difference information from the base layer encoder 30 is passed to the enhancement 
layer encoder 40 that encodes the enhancement information. 

The encoding of the enhancement layer encoder is performed pursuant to 
methods 500 - 900 as depicted in Figs. 5-10, respectively and will be briefly 
described. The bitstream from the base layer encoder 30 and the N bitstreams from the 
enhancement layer encoder 40 are capable of being sent to the transmission channel 60 
by at least two methods. 

In the first method all bitstreams are multiplexed together by multiplexor 50 
with different priority identifiers, e.g., the base layer bitstream is guaranteed, 
enhancement bitstream layer 1 provided by enhancement layer encoder 40 is given a 
higher priority than enhancement bitstream layer 2. The prioritization is continued 
until all N (wherein N is an integer from 0, 1, 2, . . . ) of the bitstreams layers are 
prioritized. Logic in the encoding layers 30 or 40 in negotiation with the network and 
intermediated devices determine the number N of bitstream layers to be generated. 

The number of bitstream layers generated is a function of the total possible 
bandwidth of the transmission channel 60, i.e. Ethernet, LAN, or WAN connections 
(this list is not intended to exhaustive but only representation of potential limiting 
devices and/or equipment), and the network and other intermediate devices. The 
number of bitstream layers M (wherein M is an integer and M < N) reaching the 
destination point 100 can be further limited by not just the physical constraints of the 
intermediate devices but the congestion on the network, thereby necessitating the 
dropping of bitstream layers according to their priority. 

In a second method the server 50 knows the transmission channel 60 condition, 
i.e. congestion and other physical constraints, and selectively sends the bitstreams to 
the channel according to the priority identifiers. In either case, the destination point 
100 receives the bitstream for the base layer and M bitstreams for the enhancement 
layer, where M < N. 

The bitstreams M are sent to the base layer 90 and enhancement layer 80 
decoders after being demultiplexed by demultiplexer 70. The decoded enhancement 
information from the enhancement layer decoder is passed to the base layer decoder to 
composite the reconstructed video output 100. The decoding of the multiplexed 
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bitstreams are accomplished pursuant to the methods and algorithms depicted in flow 
diagrams 1 100 - 1400 of Figs. 11-14, respectively. 

The base layer encoder and decoder are capable of performing logic pursuant 
to the MPEG-1 , MPEG-2, or MPEG-4 (Version- 1) standards that are hereby 
5 incorporated by reference into this disclosure. 

Taking Residue with Probability Distribution Preserved 

A detailed description of the probability distribution residue will now be made 
with reference to Figs 2A - 3B 

10 In the current MPEG-2 signal-to-noise ratio (SNR) scalability extension, a 

residue or difference is taken between the original DCT coefficient and the quantized 
DCT coefficient. Fig. 2A illustrates the distribution of a residual signal as a DCT 
coefficient. In taking the residue small values have higher probabilities and large 
values have smaller probabilities. The intervals along the horizontal axis represent 

15 quantization bins. The dot in the center of each interval represents the quantized DCT 

coefficient. Taking the residue between the original and the quantized DCT coefficient 
is equivalent to moving the origin to the quantization point. 

Therefore, the probability distribution of the residue becomes that as shown in 
Figure 2B. The residue from the positive side of Fig. 2A has a higher probability of 

20 being negative than positive and the residue taken from the negative side of the Fig. 2A 

has a higher probability of being positive than negative. The result is that the 
probability distribution of the residue becomes almost uniform. Thus making coding 
the residue more difficult. 

A vastly superior method is to generate a difference between the original and 

25 the lower boundary points of the quantized interval as shown in Fig. 3 A and Fig. 3B. 

In this method, the residue is taken from the positive side of Fig. 2 A remains positive 
and the residue from the negative side of Fig. 2 A remains negative. Taking the residue 
is equivalent to moving the origin to the reference point as illustrated in Fig. 3 A. Thus, 
the probability of the residue becomes as shown in Fig. 3B. This method preserves the 

30 shape of the original non-uniform distribution. Although the dynamic range of the 

residue taken in such a manner seems to be twice of that depicted in Fig. 2B, their is 
no longer a need to code the sign, i.e. - or +, of the residue. The sign of the residue is 
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encoded in the base layer bitstream corresponding the enhancement layer, therefore 
this redundancy is eliminated and bits representing the sign are thus saved. Therefore, 
there is only a need to code the magnitude that still has a nonuniform distribution. 

Bit plane coding of residual DCT coefficients 

After taking residues of all the DCT coefficients in an 8 x 8 block, bit plane 
coding is used to code the residue. In bit-plane coding method the bit-plane coding 
method considers each residual DCT coefficient as a binary number of several bits 
instead of as a decimal integer of a certain value as in the run-level coding method. 
The bit-plane coding method in the present invention only replaces runlevel coding 
part. Therefore, all the other syntax elements remain the same. 

An example of and description of the bit-plane coding method will now be 
made, wherein 64 residual DCT coefficients for an Inter-block and 63 residual DCT 
coefficients for an Intra-block (excluding the Intra-DC component that is coded using 
a separate method) are utilized for the example. The 64 (or 63) residual DCT 
coefficients are ordered into a one-dimensional array and at least one of the residual 
coefficients is non-zero. The bit-plane coding method then performs the following 
steps. 

The maximum value of all the residual DCT coefficients in a frame is 
determined and the minimum number of bits, N, needed to represent the maximum 
value in the binary format is also determined. N is the number of bitplanes layers for 
this frame and is coded in the frame header. 

Within each 8x8 block is represent every one of the 64 (or 63) residual DCT 
coefficients with N bits in the binary format and there is formed N bit-planes or layers 
or levels. A bit-plane is defined as an array of 64 (or 63) bits, taken one from each 
residual DCT coefficient at the same significant position. 

The most significant bit-plane is determined with at least one non-zero bit and 
then the number of all-zero bit-planes between the most significant bit-plane 
determined and the Nth one is coded. Then starting from the most significant bit plane 
(MSB plane), 2-D symbols are formed of two components: (a) number of consecutive 
O's before a I (RUN), (b) whether there are any l's left on this bit plane, i.e. End-Of- 
Plane (EOP). If a bit-plane after the MSB plane contains all O's, a special symbol 
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ALL-ZERO is formed to represent an all-zero bit-plane. Note that the MSB plane 
does not have the all-zero case because any all-zero bit-planes before the MSB plane 
have been coded in the previous steps. 

Four 2-D VLC tables are used, wherein the table VT-C-Table-0 corresponds to 
the MSB plane; table VLC-Table- I corresponds to the second MSB plane; table VLC- 
Table-2 corresponds to the third MSB plane; and table VLC-Table-3 corresponds to 
the fourth MSB and all the lower bit planes. For the ESCAPE cases, RUN is coded 
with 6 bits, EOP is coded with 1 bit. Escape coding is a method to code very small 
probability events which are not in the coding tables individually. 

An example of the above process will now follow. For illustration purposes, 
we will assume that the residual values after the zigzag ordering are given as follows 
and N = 6: The following representation is thereby produced. 

10, 0, 6, 0, 0, 3, 0, 2, 2, 0, 0, 2, 0, 0, 1, 0, ... 0, 0 

The maximum value in this block is found to be 10 and the minimum number of 
bits to represent 10 in the binary format (1010) is 4. Therefore, two all-zero bit-planes 
before the MSB plane are coded with a code for the value 2 and the remaining 4 bit- 
planes are coded using the (RUN, EOP) codes. Writing every value in the binary 
format using 4 bits, the 4 bit-planes are formed as follows: 

1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,0, 0 (MSB-plane) 

0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,0, 0 (Second MSB-plane) 

1, 0, 1, 0, 0, 1, 0, 1, 1, 0, 0, 1, 0, 0, 0, 0,0, 0 (Third MSB-plane) 

0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 1 , 0,0, 0 (Fourth MSB-plane or LSB-plane) 

Converting the bits of each bit-plane into (RUN, EOP) symbols results in the 
following: 



(0,1) 
(2,1) 

(0, 0), (1,0), (2,0), (1,0), (0, 0), (2, 1) 



(MSB-plane) 
(Second MSB-plane) 
(Third MSB-plane) 
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(5,0), (8,1) 



(Fourth MSB-plane or LSB-plane) 



Therefore, there are 10 symbols to be coded using the (RUN, EOP) VLC 
tables. Based on their locations in the bit-planes, different VLC tables are used for the 
coding. The enhancement bitstream using all four bitplanes looks as follows: 
code leading-all-zero(2) 
code msb(0, 1) 
code msb- 1(2,1) 

code-msb-2(0,0), code__msb-2(l,0), code-msb-2(2,0), code-msb-2(l,0), code-msb- 
2(0,0), code-msb-2(2, 1) code_msb-3(5,0), code__msb-3(8, 1). 

In an alternative embodiment, several enhancement bitstreams may be formed 
from the four bit-planes, in this example from the respective sets comprising one or 
more of the four bit-planes. 

Motion Vector Sharing 

In this alternative embodiment of the present invention motion vector sharing is 
capable of being utilized when the base layer bitstream exceeds a predetermined size or 
more levels of scalability are needed for the enhancement layer. By lowering the 
number of bits required for coding the motion vectors in the base layer the bandwidth 
requirements of the base layer bitstream is reduced. In base layer coding, a 
macroblock (16 x 16 pixels for the luminance component and W pixels for each chron- 
luminance components) of the current frame is compared with the previous frame 
within a search range. The closest match in the previous frame is used as a prediction 
of the current macroblock. The relative displacement of the prediction to the current 
macroblock, in the horizontal and vertical directions, is called a motion vector. 

The difference between the current macroblock and it's prediction is coded 
using the DCT coding. In order for the decoder to reconstruct the current 
macroblock, the motion vector has to be coded in the bitstream. Since there is a fixed 
number of bits for coding a frame, the more bits spent on coding the motion vectors 
results in fewer bits for coding the motion compensated differences. Therefore, it is 
desirable to lower the number of bits for coding the motion vectors and leave more bits 
for coding the differences between the current macroblock and its prediction. 
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For each set of 2 x 2 motion vectors, the average motion vector can be 
determined and used for the four macroblocks. In order to not change the syntax of 
the base layer coding, four macroblocks are forced to have the identical motion 
vectors. Since only one out four motion vectors is coded in the bitstream, the amount 
of bits spent on motion vector coding is reduced, therefore, there are more bits 
available for coding the differences. The cost for pursuing such a method is that the 
four macroblocks, which share the same motion vector may, not get the best matched 
prediction individually and the motion compensated difference may have a larger 
dynamic range, thus necessitating more bits to code the motion vector. 

For a given fixed bitrate, the savings from coding one out of four motion 
vectors may not compensate the increased number of bits required to code the 
difference with a larger dynamic range. However, for a time varying bitrate, a wider 
dynamic range for the enhancement layer provides more flexibility to achieve the best 
possible usage of the available bandwidth. 

Coding Sign Bits 

In an alternative embodiment of the present invention, if the base layer 
quantized DCT coefficient is non-zero, the corresponding enhancement layer 
difference will have the same sign as the base layer quantized DCT. Therefore, there is 
no need to code the sign bit in the enhancement layer. 

Conversely, if the base layer quantized DCT coefficient is zero and 
corresponding enhancement layer difference is non-zero, a sign bit is placed into 
enhancement layer bitstream immediately after the MSB of the difference. 
An example of the above method will now follow. 

Difference of a DCT block after ordering 

- 10, 0, 6, 0, 0, 3, 0, 2, 2, 0, 0, 2, 0, 0, 1, 0, ...0, 0 

Sign indications of the DCT block after ordering 

- 3, 3, 3, 3, 2, 0, 3, 3, 1, 2, 2, 0, 3, 3, 1, 2, ... 2, 3 

- 0: base layer quantized DCT coefficient = 0 and difference >0 

- 1 : base layer quantized DCT coefficient = 0 and difference <0 

- 2: base layer quantized DCT coefficient = 0 and difference =0 
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- 3: base layer quantized DCT coefficient = 0. 

In this example, the sign bits associated with values 10, 6, 2 don't need to be 
coded and the sign bits associated with 3, 2, 2, 1 are coded in the following way: 
Code(All Zero) 
code (All Zero) 
code(0,l) 
code(2,l) 

code(0,0),code(l,0Xcode(2,0),0,code(l ? 0) 5 code(0,0) ? l,code(2,l),0 
code(5,0),code(8,l),l 

For every DCT difference, there is a sign indication associated with it. There 
are four possible cases. In the above coding 0, 1 , 2, and 3 are used to denote the four 
cases. If the sign indication is 2 or 3, the sign bit does not have to be coded because it 
is either associated with a zero difference or available from the corresponding base 
layer data. If the sign indication is 0 or 1 a sign bit code is required once per difference 
value, i.e. not every bit-plane of the difference value. Therefore, a sign bit is put 
immediately after the most significant bit of the difference. 

Optimal Reconstruction of the DCT Coefficients 

In an alternative embodiment of the present invention, even though N 
enhancement bitstream layers or planes may have been generated, only M, wherein M 
< N enhancement layer bits are available for reconstruction of the DCT coefficients 
due to the channel capacity, and other constraints such as congestion among others, 
the decoder 80 of Fig. 1 may receive no enhancement difference or only a partial 
enhancement difference. In such a case, the optimal reconstruction of the DCT 
coefficients is capable of proceeding along the following method: 

If decoded difference = 0, the reconstruction point is the same as that in base 
layer, otherwise, the reconstructed difference = decoded difference + l A 
*(l«decoded__bit_plane) and the reconstruction point = reference point + 
reconstructed difference * Q_enh +Q_enh/2. 

In the present embodiment, referring to Figs. 3C and 3D, the optimal 
reconstruction point is not the lower boundary of a quantization bin. The above 
method specifies how to obtain the optimal reconstruction point in cases where the 



-14- 



WO 00/05898^P W PCTAJS99/16638 

difference is quantized and received partially, i.e. not all of the enhancement layers 
generated are either transmitted or received as shown in Fig. 1 . wherein M < N. 
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What is claimed is: 

1 . A video encoding method for adapting a video input to a bandwidth of a 
transmission channel of a network, the method comprising the steps of: 

determining number N of enhancement layer bitstreams capable of being 

adapted to said bandwidth of said transmission channel of said network; 
encoding a base layer bitstream from said video input; 
encoding N number of enhancement layer bitstreams from said video 
input based on the base layer bitstream, wherein the 

N enhancement layer bitstreams complements the base layer bitstream; and 
providing the base layer bitstream and N enhancement layer bitstreams to said 
network. 

2. The video encoding method according to claim 1, wherein the 
determining step includes negotiating with intermediate devices on said 
network. 

3. The video encoding method according to claim 2, wherein 
negotiating includes determining destination resources. 

4. The video encoding method according to claim 1, wherein the step of 
encoding the base layer bitstreams is performed by a MPEG-1 encoding 
method. 

5. The video encoding method according to claim 1, wherein the step of 
encoding the base layer bitstreams is performed by a MPEG-2 encoding 
method. 

6. The video encoding method according to claim 1 , wherein the step of 
encoding the base layer bitstreams is performed by a MPEG-4 encoding 
method. 
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7. The video encoding method according to claim 1 , wherein the step of 
encoding the base layer bitstreams is performed by a Discrete Cosine 
Transform (DCT) method. 



8. The video encoding method according to claim 7, wherein after 

encoding the base layer bitstreams by a Discrete Cosine Transform (DCT) 
method a DCT coefficient is quantized. 



9. The video encoding method according to claim 1, wherein the enhancement 
layer bitstreams are based on the difference of an original base layer DCT 
coefficient and a corresponding base layer quantized DCT coefficient. 

10. The video encoding method according to claim 1 , wherein the base 
layer bitstream and the N enhancement layer provide to the network are 
multiplexed. 



11. A video decoding method for adapting a video input to a bandwidth of a 
transmission channel of a network, the method comprising the steps of: 

determining number M of enhancement layer bitstreams of said video input 
capable of being received from said transmission channel of said 
network; 

decoding a base layer bitstream from received video input; 

decoding M number of enhancement layer bitstreams from the received video 
input based on the base layer bitstream, wherein the M received 
enhancement layer bitstreams complements the base layer bitstream; 

and 

reconstructing the base layer bitstream and N enhancement layer bitstreams. 

12. The video decoding method according to claim 11, wherein the 
determining step includes negotiating with intermediate devices on said 
network. 
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13. The video decoding method according to claim 1 2, wherein 
negotiating includes determining destination resources. 

14. The video decoding method according to claim 11, wherein the step of 
decoding the base layer bitstreams is performed by a MPEG-1 decoding 
method. 

15. The video decoding method according to claim 11, wherein the step of 
decoding the base layer bitstreams is performed by a MPEG-2 decoding 
method. 

1 6. The video decoding method according to claim 1 1 , wherein the step of 
decoding the base layer bitstreams is performed by a MPEG-4 decoding 
method. 

17. The video decoding method according to claim 11, wherein the step of 
decoding the base layer bitstreams is performed by a Discrete Cosine 
Transform (DCT) method. 

18. The video decoding method according to claim 1 7, wherein after 
decoding the base layer bitstreams by a Discrete Cosine Transform (DCT) 
method a DCT coefficient is unquantized. 

19. The video decoding method according to claim 1 1 , wherein coding of the 
enhancement layer bitstreams are based on the difference of an original base 
layer DCT coefficient and a corresponding base layer quantized DCT 
coefficient. 

20. The video decoding method according to claim 1 1 , wherein the base 
layer bitstream and the M enhancement layers to be reconstructed are de- 
multiplexed. 
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21. A video decoding method for adapting a video input to a bandwidth of a 
receiving apparatus, the method comprising the steps of: 

demultiplexing a base layer bitstream and at least one of a plurality of 
enhancement layer bitstreams received from a network; 
decoding the base layer bitstream; 

decoding at least one of the plurality of enhancement layer bitstreams based 

on generated base layer bitstream, wherein the at least one of the plurality of 
enhancement layer bitstreams enhances the base layer bitstream; and 
reconstructing a video output. 

22. A video encoding method for encoding enhancement layers based on a base 
layer bitstream encoded from a video input, the video encoding method comprising the 
steps of: 

taking a difference between an original DCT coefficient and a reference point; 

and 

dividing the difference between the original DCT coefficient and the reference 
point into N bit-planes. 

23. The video encoding method according to claim 22, wherein RUN and EOP 
symbols represents the N bit-planes of a DCT block. 

24. The video encoding method according to claim 23, wherein the RUN and EOP 
symbols are encoded. 

25. The video encoding method according to claim 24, wherein a sign bit is 
encoded if the DCT difference is equal to zero or the sign of the DCT difference is the 
same as the corresponding base layer bitstream data. 
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26. A video decoding method for reconstructing DCT coefficients M enhancement 
layers of N enhancement layers have been received, wherein M < N, comprising: 

means for taking a reconstruction difference as a decoded difference and a 
portion of a decoded bit-plane; 

means for taking a reconstruction point as a reference point and a 
reconstructed difference; and 
determining an optimal reconstruction point. 

27. A method of coding motion vectors of a plurality of macroblocks, the method 
comprising the steps of: 

determining an average motion vector from N motion vectors for N 
macroblocks; 

utilizing the determined average motion vector as the motion vector for the N 

macroblocks; and 
encoding 1/N motion vectors in a base layer bitstream. 
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optimal point is AQC*(2*Q) + Q 



BNSDOCID: <WO 0005898A2J_> 



WO 00/05898 



PCT/US99/16638 




BNSDOCID: <WO 0005898A2_L> 



WO 00/05898 



PCT/US99/16638 



FIG. 6 



7/16 



Start 



600 



Input a block of DCT 
coefficient differences 



Input a bit-plane of the 
block of DCT coefficient 
differences 



Set RUN = 0 



input a bit of the 
bit-plane, b 



610 



620 



630 



_-_ — 




BNSDOCID: <WO 0005898A2_I_> 



WO 00/05898 



PCT/US99/16638 



8/16 



700 



FIG. 7 



Start 



Put max-bit-plane 
value into bit stream 
with 4 bits 



Input (RUN, EOP) symbols and 
sign-enh values of one bit-plane of a 
frame 



Input (RUN, EOP) symbols and 
sign-enh values of one DCT block of 
the bit-plane 



710 



720 



730 



*** SEE FLOW DIAGRAM 800 — 
Encode (RUN, EOP) symbols and Sign-enh 
values of one DCT block of the bit-plane. 




740 



BNSDOCID: <WO OO05898A2J_> 



WO 00/05898 



PCT/US99/16638 



9/16 



800 



Start 



810 



Yes 



Input one symbol 




820 



830 



Yes 



1 



Put code for the 
symbol into bitstream 



840 



Put escape code 
into bitstream 



Put 6 bits for RUN 
into bitstream 



i 



Put 1 bit for EOP 
into bitstream 



850 



860 



870 



Encode sign-enh value 
•SEE FLOW DIAGRAM 900* 



890 



880 



Put code for (All zero) 
into bitstream 




End 



FIG. 8 



BNSDOCID: <WO 0005898 A2 J _> 



WO 00/05898 



10/16 



PCT/US99/16638 



900 



Yes 




Put one bit of Sign-enh 
value into bitstream 






Set sign 


-enh = 3 


» 


f 


End 



920 



FIG. 9 



BNSDOCID: <WO 0005898A2_L> 



WO 00/05898 



PCT/US99/16638 



11/16 



1000 



Start 




I 



input a a iff value and a 
corresponaing base tayer 
quantified OCT coefficient QC 



Find absolute value and sign 
of QC: AQC and SQC 



1010 



1020 



1030 



Yes 




ref = tower boundary of quantization bin 
(not optimal reconstruction point) 



ref = 0 



get sign bit and 
assign it to 
SRC: 
if sign bit = 0. 

SRC = 1 
if sign bit = 1 . 
SRC = -1 



ARC = ref + diff 



* RC = SRC * ARC 



I 




1040 



RC is the constructed OCT 
coefficient. 

SRC is me sign of RC and 

ARC is tne absolute value 
of RC. 



FIG. 10 



BNSDOCID: <WO 000589BA2_I_> 



WO 00/05898 



12/16 



PCT/US99/16638 



Start 



1100 



Set diff = 0 



set bit-index = max_bit_plane -1 



get one bit b, from bit-plane 
buffer of bit-index 



diff ♦= b 


»2 bit tndex 






bitjndex — 1 



1110 



1120 



1130 



1140 



1150 




1160 



1170 




End 



FIG. 11 



BNSDOCID: <WO 0005898A2J_> 



WO 00/05898 



13/16 



PCT/US99/16638 



1200 



Start 



Get max_bit_plane value from 
bitstream by reading 4 bits 



"SEE FLOW DIAGRAM 1300** 
Decode (RUN, EOP) symbols and 
sign_enh values of one DCT block 
of one bit-plane 




1210 



1220 



1230 



FIG. 12 



BNSDOCID: <WO 0O05898 A2 J _> 



WO 00/05898 



14/16 



PCT/US99/16638 



1300 



Start 



Decode a symbol 
from bitstream 



1310 



1320 




Put RUN Os and a 
1 into bit-plane buffer 



Get EOP from the next 
1 bit in the bitstream 



SEE FLOW DIAGRAM 
1400 Decode sign_ enh value' 




Put Os to the bit- 
plane buffer until the 
end of the block 



FIG, 13 



End 



BNSDOCID: <WO O005898A2 I > 



WO 00/05898 



15/16 



PCT/US99/16638 



get one bit from the 
bitstream as sign bit 



Set sign-enh = 3 



End 



1400 




1410 



1420 



1430 



FIG. 14 



BNSDOCID: <WO O005898A2J_> 



WO 00/05898 



PCT/US99/16638 




WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 
H04N 7/30 



A3 



(II) International Publication Number: WO 00/05898 

(43) International Publication Date: 3 February 2000 (03.02.00) 



(21) International Application Number: PCT/US99/ 16638 

(22) International Filing Date: 21 July 1999 (21.07.99) 



(30) Priority Data: 

60/093,860 
09/169,829 



23 July 1998 (23.07.98) US 
1 1 October 1 998 ( 1 1 . 1 0.98) US 



(63) Related by Continuation (CON) or Continuation-in-Part 
(CIP) to Earlier Applications 

US 60/093,860 (CIP) 

Filed on 23 July 1998 (23.07.98) 

US 09/169,829 (CIP) 

Filed on 1 1 October 1998 (1 1.10.98) 



(71) Applicant (for all designated States except US): OPTI VISION, 

INC. [US/US]; 3450 Hillview Avenue, Palo Alto, CA 94304 
(US). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): LI, Weiping [US/US]; 159 
California Avenue, J 103, Palo Alto, CA 94306 (US). 

(74) Agent: DAVIS, Paul; Wilson Sonsini Goodrich & Rosati, 650 
Page Mill Road, Palo Alto, CA 94304-1050 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, 
GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, 
KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, 
SK, SL, TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZA, 
ZW, ARIPO patent (GH, GM, KE, LS, MW, SD, SL, SZ, 
UG, ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, 
RU, TJ, TM), European patent (AT, BE, CH, CY, DE, DK, 
ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI 
patent (BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, 
NE, SN, TD, TG). 



Published 

With international search report. 

(88) Date of publication of the international search report: 

27 April 2000 (27.04.00) 



(54) Title: SCALABLE VIDEO CODING AND DECODING 



ORIGINAL 
VIDEO _ 
INPUT 
20 



RECONSTRUCTED 
VIDEO ^ 
OUTPUT 
100 



ENHANCEMENT LAYER 
ENCODER 


O N 
• 


40 


2^ 


t 


ENHANCEMENT 1 
BITSTREAM 


BASE LAYER 
ENCODER 
30 


BASE LAYER 
BITSTREAM 






ENHANCEMENT LAYER 
DECODER 
80 


* M 
o 


* 2 




ENHANCEMENT 1 
BITSTREAM 


BASE LAYER 
DECODER 
90 


BASE LAYER 
BITSTREAM 



MUX 
OR 
SERVER 
50 



DEMUX 
70 



CHANNEL 
60 



(57) Abstract 

A video encoding method and apparatus for adapting a video input to a bandwidth of a transmission channel of a network that 
includes determining the number N enhancement layer bitstreams capable of being adapted to the bandwidth of the transmission channel 
of a network. A base layer bitstream is encoded from the video input wherein a plurality of enhancement layer bitstreams are encoded 
from the video input. The enhancement layer bitstreams are based on the base layer bitstream, wherein the plurality of enhancement layer 
bitstreams complements the base layer bitstream and the base layer bitstream and N enhancement layer bitstreams are transmitted to the 
network. 



BNSDOCID: <WO 0005898A3 I > 




FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TC 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


Cdte d' Noire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







BNSDOCID: <WO 0005898A3J_> 



IN^*N 



ATIONAL SEARCH REPOR" 



Intr ionai Application No 

PCT/US 99/16638 



A. CLASSIFICATION OF SUBJECT MATTER 

PC 7 H04N7/3O 




IPC 7 H04N 



Documentation searched other than minimum 



Etectronio data base consulted during t 



documentation to the extent that such documents are included in me fields searched 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° 



Citation of document, with indication, where appropriate, of the relevant passages 

EP 0 739 139 A (SUN MICROSYSTEMS INC) 
23 October 1996 (1996-10-23) 



page 2, 1 

page 3, 1 

page 4, 1 

page 8, 1 



ne 44 - line 55 
ne 27 - line 48 
ne 45 -page 5, line 10 
ne 3 - line 33 



page 9, line 18 - line 24 

GONZALES C ET AL: "FLEXIBLY SCALABLE 
DIGITAL VIDEO CODING" M „«„ 
SIGNAL PROCESSING. IMAGE COMMUNICATION, 
vol. 5, no- 1/02, 

1 February 1993 (1993-02-01), pages 5-20, 
XPO0O345610 

ISSN: 0923-5965 
page 11, paragraph 2 -page 16, paragraph 
3; figure 4 



|"Yj Further documents are listed in the continuation of box C. 

Special categories of cited documents : 

- document defining the general state of the art which is not 

considered to be of particular relevance 
■ earlier document but published on or after the international 
filing date 

document which may throw doubts on priority J^*)" 
which is cited to establish the publication date of another 
citation or other special reason (as specified} 
• document referring to an oral disclosure, use, exhibition or 
other means 

document published prior to the international filing date but 
later than the priority date claimed 

Date of the actual oomplebon of the international search 



-/-- 



Relevant to claim No. 



1-21 



4-10, 
14-20 



Patent family members are listed in annex. 



later dooument published after the international «mg date 
or priority date and not in conflict with the application our 
cited to understand the principle or theory underlying the 
invention 

■ document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document n taken atone 
• document of particular relevanoe; the claimed invention 
cannot be considered to involve an inventive «tepwhenthe 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person sKiueo 
in the art. 

•V document member of the same patent family 



5 November 1999 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patenttaan 2 
NL - 2280 HV Rqswifr 
Tel (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+21-70)340-3016 



2 6. 01. 00 



Authorized officer 



Raeymaekers, P 



Form PCT/tSA/210 (second theet) <Juty 1992) 

BNSDOCID: <WO 0O05898A3_l_> 



page 



1 of 2 



INTERNv^ONAL SEARCH REPORT A ^Application no 

|PCT/US 99/16638 


C<Contlnuation) DOCUMENTS CONSIDERED TO BE RELEVANT 




Category* 


Citation of dooument, with indioation, where appropriate, of the relevant passages 




A 


DE 195 31 004 A (IBM) 
27 February 1997 (1997-02-27) 
column 3, line 6 -column b, line hl, 
figure 3 


1-21 


A 


WO 98 27733 A (8X8 INC) 

25 June 1998 (1998-06-25) 

page 9, line 10 -page 11, line 15 


9 


A 


EP 0 771 119 A (TOKYO SHIBAURA ELECTRIC 
CO) 2 May 1997 (1997-05-02) 
page 6, line 53 -page 7, line 21 


9 



Form PCT/tSA/^IO (oorrtwuaiian of second *haet) (July 1992) 



BNSDOCID; <WO 000589BA3J_> 

page 



INTERNATIONAL SEARCH REPORT 



International application No. 

PCT/US 99/16638 



Box I Observations 



where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 



This International Search Report has not been established in respect o. certain claims under Article 17(2)(a) for the following reasons: 
1 ' O Sse fhey relate to subject matter not required to be searched by this Authority, namely: 



2 □ g£ETS!ei relate to parts of the International Application that do not comply with the prescribed requirements to such 
SSS no meaningful International Search can be carried out. specially: 



3. 1 1 Claims Nos.: 

because they are 



dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box 



II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1 . rn As all required additional search fees were timely paid by the applicant, this International Search Report covers all 

I— I searchable claims. 

2. □ As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3 |— | As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
3 LJ ^vera only^thoseclaimt for which fees were paid, specifically clatms Nos.: 



4 [71 No required additional search fees were timely paid by the applicant Consequently this International Search Report is 
^ restricted to the invention first mentioned in the claims; rt ,s covered by claims Nos.. 



1-21 



Remark on Protest 



| | The additional search fees were accompanied by the applicant's protest. 
| | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/21 0 (continuation of first sheet (1 )) (July 1 998) 

BNSDOCID: <WO 000S898A3_I_> 



INTERNATIONAL SEARCH REPORT 



International Application No. PCT/US 99/16638 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



1. Claims: 1-21 



Hierarchical DCT coder and respective decoder with number of 
enhancement levels determined on the basis of the 
transmission channel bandwidth 



2. Claims: 22-26 



hierarchical coder and decoder with bit planes as 
enhancement layers 



3. Claim : 27 

determining motion vectors in a hierarchical coder 



BNSDOCID: <WO 0005898A3J_> 



ATIONAL SEARCH REPORT ( 

InScrmsttcn on patent family members 



Inl tlonai Application No 

PCT/US 99/16638 



Patent document 
cited in search report 



Publication 
date 



Patent f amity 
members) 



EP 0739139 



23-10-1996 



US 
JP 



5742892 A 
9139940 A 



Publication 
date 



21-04-1998 
27-05-1997 



DE 19531004 


A 


27-02-1997 


US 


5914751 A 




W0 9827733 


A 


25-06-1998 


NONE 






EP 0771119 


A 


02-05-1997 


JP 
US 


9182084 A 
5818531 A 


06-10-1998 



BNSDOCID: <WO. 



Fatm PCT/lSAdO (patent famtty «iuwi) 1992) 

.0005898A3_I_> 



ft 



9 



